Stochastic programming

Results: 538



#Item
221Dynamic programming / Stochastic control / Markov decision process / Reinforcement learning / Markov chain / Valuation / Secretary problem / Statistics / Markov processes / Markov models

Better Be Lucky Than Good: Exceeding Expectations in MDP Evaluation Thomas Keller and Florian Geißer University of Freiburg Freiburg, Germany {tkeller,geisserf}@informatik.uni-freiburg.de

Add to Reading List

Source URL: gki.informatik.uni-freiburg.de

Language: English - Date: 2015-03-07 08:48:57
222Stochastic control / Control theory / Search algorithms / Partially observable Markov decision process / Markov decision process / Tree traversal / Game theory / Statistics / Dynamic programming / Markov processes

Past, Present, and Future: An Optimal Online Algorithm for Single-Player GDL-II Games 1 ¨ Florian Geißer and Thomas Keller and Robert Mattmuller

Add to Reading List

Source URL: gki.informatik.uni-freiburg.de

Language: English - Date: 2014-08-08 08:44:39
223Systems theory / Dynamic programming / Equations / Operations research / Stochastic control / Lipschitz continuity / Optimal control / S0 / Markov decision process / Statistics / Mathematical optimization / Control theory

Sample Complexity and Performance Bounds for Non-parametric Approximate Linear Programming Jason Pazis and Ronald Parr Department of Computer Science, Duke University Durham, NC 27708 {jpazis,parr}@cs.duke.edu

Add to Reading List

Source URL: www.cs.duke.edu

Language: English - Date: 2013-04-10 17:56:12
224Markov processes / Stochastic control / Control theory / Partially observable Markov decision process / Valuation / Markov decision process / Algorithm / Function / Statistics / Mathematics / Dynamic programming

Point-Based Policy Iteration Shihao Ji, Ronald Parr† , Hui Li, Xuejun Liao, and Lawrence Carin Department of Electrical and Computer Engineering † Department of Computer Science Duke University

Add to Reading List

Source URL: www.cs.duke.edu

Language: English - Date: 2007-04-27 18:47:12
225Dynamic programming / Stochastic control / Mathematical optimization / Markov processes / Numerical analysis / Reinforcement learning / Markov decision process / Approximation / Least squares / Statistics / Mathematical analysis / Mathematics

Journal of Machine Learning Research[removed]1149 Submitted 8/02; Published[removed]Least-Squares Policy Iteration Michail G. Lagoudakis

Add to Reading List

Source URL: www.cs.duke.edu

Language: English - Date: 2003-12-20 17:38:33
226Dynamic programming / Stochastic control / Reinforcement learning / Markov decision process / Q-learning / Machine learning / Finite-state machine / Markov chain / Algorithm / Statistics / Markov processes / Markov models

Reinforcement Learning with Hierarchies of Machines Ronald Parr and Stuart Russell Computer Science Division, UC Berkeley, CA[removed]parr,russell @cs.berkeley.edu

Add to Reading List

Source URL: www.cs.duke.edu

Language: English - Date: 2012-05-29 14:54:19
227Dynamic programming / Markov processes / Stochastic control / Reinforcement learning / Support vector machine / Markov decision process / Supervised learning / Temporal difference learning / Policy / Statistics / Machine learning / Statistical classification

Reinforcement Learning as Classification: Leveraging Modern Classifiers Michail G. Lagoudakis Ronald Parr Department of Computer Science, Duke University, Durham, NC[removed]USA

Add to Reading List

Source URL: www.cs.duke.edu

Language: English - Date: 2003-12-20 17:25:46
228Markov models / Stochastic control / Markov decision process / Reinforcement learning / Bellman equation / Markov chain / Partially observable Markov decision process / Statistics / Dynamic programming / Markov processes

Modular Value Iteration Through Regional Decomposition Linus Gisslen, Mark Ring, Matthew Luciw, and J¨ urgen Schmidhuber IDSIA Manno-Lugano, 6928, Switzerland

Add to Reading List

Source URL: agi-conference.org

Language: English - Date: 2012-12-09 10:12:52
229Applied mathematics / Mathematical optimization / Dynamic programming / Markov processes / Stochastic control / Reinforcement learning / Markov decision process / Function / Algorithm / Mathematics / Statistics / Operations research

Journal of Artificial Intelligence Research[removed]468 Submitted 1/02; published[removed]Efficient Solution Algorithms for Factored MDPs Carlos Guestrin

Add to Reading List

Source URL: www.cs.cmu.edu

Language: English - Date: 2003-10-30 16:25:03
230Stochastic control / S0 / Markov decision process / Planning Domain Definition Language / Reinforcement learning / Automated planning and scheduling / Bellman equation / Temporal difference learning / Tree / Statistics / Dynamic programming / Markov processes

PROST: Probabilistic Planning Based on UCT Thomas Keller and Patrick Eyerich Albert-Ludwigs-Universit¨at Freiburg Institut f¨ur Informatik Georges-K¨ohler-Allee[removed]Freiburg, Germany

Add to Reading List

Source URL: gki.informatik.uni-freiburg.de

Language: English - Date: 2012-05-24 10:56:58
UPDATE